Super Paramagnetic Clustering of DNA Sequences
Identifieur interne : 002D87 ( Main/Exploration ); précédent : 002D86; suivant : 002D88Super Paramagnetic Clustering of DNA Sequences
Auteurs : Sugiarto Radjiman [Singapour] ; Han Lianyi [Singapour] ; Wang Jian-Sheng [Singapour] ; Chen Yu Zong [Singapour]Source :
- Journal of Biological Physics [ 0092-0606 ] ; 2006-01-01.
English descriptors
- KwdEn :
Abstract
Abstract: An unsupervised clustering of 4541 DNA sequences containing active promoter regions from vertebrate and arthropod classes (including their viral genes) was performed. All necessary information was solely gathered a priori from the DNA sequences by measuring frequencies of tri-nucleotides and tetra-nucleotides. We employed Super Paramagnetic Clustering, a novel clustering algorithm based on physical properties of an inhomogeneous granular ferromagnet. This method utilizes Swendsen-Wang cluster Monte Carlo simulations to distinguish clusters by measuring pairs of correlation functions from different resolutions. We identified two strongly separated clusters of human viral genes corresponding to the Epstein-Barr virus and the Herpes Simplex virus type 1. In addition, vertebrate and arthropod sequences were successfully separated into two different classes with merely 9.25% of arthropod sequences being misclassified. From a functional perspective, these sequences have high gene function correlations with sequences from the vertebrate cluster. By tuning a clustering parameter, Super Paramagnetic Clustering was able to classify vertebrate class further into two major clusters, from where a large number of housekeeping genes and tissue-specific genes were found respectively. The indications came from observation of gene expression function and consensus transcription factors which were found grouped together in specific positions of the DNA sequences.
Url:
DOI: 10.1007/s10867-006-2120-0
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 000688
- to stream Istex, to step Curation: 000688
- to stream Istex, to step Checkpoint: 000962
- to stream Main, to step Merge: 002E17
- to stream Main, to step Curation: 002D87
Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">Super Paramagnetic Clustering of DNA Sequences</title>
<author><name sortKey="Radjiman, Sugiarto" sort="Radjiman, Sugiarto" uniqKey="Radjiman S" first="Sugiarto" last="Radjiman">Sugiarto Radjiman</name>
</author>
<author><name sortKey="Lianyi, Han" sort="Lianyi, Han" uniqKey="Lianyi H" first="Han" last="Lianyi">Han Lianyi</name>
</author>
<author><name sortKey="Jian Sheng, Wang" sort="Jian Sheng, Wang" uniqKey="Jian Sheng W" first="Wang" last="Jian-Sheng">Wang Jian-Sheng</name>
</author>
<author><name sortKey="Zong, Chen Yu" sort="Zong, Chen Yu" uniqKey="Zong C" first="Chen Yu" last="Zong">Chen Yu Zong</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:26E3AE23957EF24AC44D870CB1866BB4F49C3740</idno>
<date when="2006" year="2006">2006</date>
<idno type="doi">10.1007/s10867-006-2120-0</idno>
<idno type="url">https://api.istex.fr/ark:/67375/VQC-BLW8RPP5-1/fulltext.pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000688</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Corpus" wicri:corpus="ISTEX">000688</idno>
<idno type="wicri:Area/Istex/Curation">000688</idno>
<idno type="wicri:Area/Istex/Checkpoint">000962</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000962</idno>
<idno type="wicri:doubleKey">0092-0606:2006:Radjiman S:super:paramagnetic:clustering</idno>
<idno type="wicri:Area/Main/Merge">002E17</idno>
<idno type="wicri:Area/Main/Curation">002D87</idno>
<idno type="wicri:Area/Main/Exploration">002D87</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">Super Paramagnetic Clustering of DNA Sequences</title>
<author><name sortKey="Radjiman, Sugiarto" sort="Radjiman, Sugiarto" uniqKey="Radjiman S" first="Sugiarto" last="Radjiman">Sugiarto Radjiman</name>
<affiliation wicri:level="4"><country xml:lang="fr">Singapour</country>
<wicri:regionArea>Department of Computational Science, National University of Singapore, 117543, Singapore</wicri:regionArea>
<orgName type="university">Université nationale de Singapour</orgName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Singapour</country>
</affiliation>
</author>
<author><name sortKey="Lianyi, Han" sort="Lianyi, Han" uniqKey="Lianyi H" first="Han" last="Lianyi">Han Lianyi</name>
<affiliation wicri:level="4"><country xml:lang="fr">Singapour</country>
<wicri:regionArea>Department of Computational Science, National University of Singapore, 117543, Singapore</wicri:regionArea>
<orgName type="university">Université nationale de Singapour</orgName>
</affiliation>
</author>
<author><name sortKey="Jian Sheng, Wang" sort="Jian Sheng, Wang" uniqKey="Jian Sheng W" first="Wang" last="Jian-Sheng">Wang Jian-Sheng</name>
<affiliation wicri:level="4"><country xml:lang="fr">Singapour</country>
<wicri:regionArea>Department of Computational Science, National University of Singapore, 117543, Singapore</wicri:regionArea>
<orgName type="university">Université nationale de Singapour</orgName>
</affiliation>
</author>
<author><name sortKey="Zong, Chen Yu" sort="Zong, Chen Yu" uniqKey="Zong C" first="Chen Yu" last="Zong">Chen Yu Zong</name>
<affiliation wicri:level="4"><country xml:lang="fr">Singapour</country>
<wicri:regionArea>Department of Computational Science, National University of Singapore, 117543, Singapore</wicri:regionArea>
<orgName type="university">Université nationale de Singapour</orgName>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="j">Journal of Biological Physics</title>
<title level="j" type="abbrev">J Biol Phys</title>
<idno type="ISSN">0092-0606</idno>
<idno type="eISSN">1573-0689</idno>
<imprint><publisher>Kluwer Academic Publishers</publisher>
<pubPlace>Dordrecht</pubPlace>
<date type="published" when="2006-01-01">2006-01-01</date>
<biblScope unit="volume">32</biblScope>
<biblScope unit="issue">1</biblScope>
<biblScope unit="page" from="11">11</biblScope>
<biblScope unit="page" to="25">25</biblScope>
</imprint>
<idno type="ISSN">0092-0606</idno>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0092-0606</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>DNA sequence</term>
<term>cluster algorithm</term>
<term>data clustering</term>
<term>promoters</term>
<term>statistical physics</term>
<term>transcription factor binding sites</term>
</keywords>
</textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: An unsupervised clustering of 4541 DNA sequences containing active promoter regions from vertebrate and arthropod classes (including their viral genes) was performed. All necessary information was solely gathered a priori from the DNA sequences by measuring frequencies of tri-nucleotides and tetra-nucleotides. We employed Super Paramagnetic Clustering, a novel clustering algorithm based on physical properties of an inhomogeneous granular ferromagnet. This method utilizes Swendsen-Wang cluster Monte Carlo simulations to distinguish clusters by measuring pairs of correlation functions from different resolutions. We identified two strongly separated clusters of human viral genes corresponding to the Epstein-Barr virus and the Herpes Simplex virus type 1. In addition, vertebrate and arthropod sequences were successfully separated into two different classes with merely 9.25% of arthropod sequences being misclassified. From a functional perspective, these sequences have high gene function correlations with sequences from the vertebrate cluster. By tuning a clustering parameter, Super Paramagnetic Clustering was able to classify vertebrate class further into two major clusters, from where a large number of housekeeping genes and tissue-specific genes were found respectively. The indications came from observation of gene expression function and consensus transcription factors which were found grouped together in specific positions of the DNA sequences.</div>
</front>
</TEI>
<affiliations><list><country><li>Singapour</li>
</country>
<orgName><li>Université nationale de Singapour</li>
</orgName>
</list>
<tree><country name="Singapour"><noRegion><name sortKey="Radjiman, Sugiarto" sort="Radjiman, Sugiarto" uniqKey="Radjiman S" first="Sugiarto" last="Radjiman">Sugiarto Radjiman</name>
</noRegion>
<name sortKey="Jian Sheng, Wang" sort="Jian Sheng, Wang" uniqKey="Jian Sheng W" first="Wang" last="Jian-Sheng">Wang Jian-Sheng</name>
<name sortKey="Lianyi, Han" sort="Lianyi, Han" uniqKey="Lianyi H" first="Han" last="Lianyi">Han Lianyi</name>
<name sortKey="Radjiman, Sugiarto" sort="Radjiman, Sugiarto" uniqKey="Radjiman S" first="Sugiarto" last="Radjiman">Sugiarto Radjiman</name>
<name sortKey="Zong, Chen Yu" sort="Zong, Chen Yu" uniqKey="Zong C" first="Chen Yu" last="Zong">Chen Yu Zong</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002D87 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 002D87 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Sante |area= MersV1 |flux= Main |étape= Exploration |type= RBID |clé= ISTEX:26E3AE23957EF24AC44D870CB1866BB4F49C3740 |texte= Super Paramagnetic Clustering of DNA Sequences }}
This area was generated with Dilib version V0.6.33. |